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We give quantum circuits that simulate an arbitrary two-qubit unitary operator up to global phase. For several 
quantum gate libraries we prove that gate counts are optimal in worst and average cases. Our lower and upper 
bounds compare favorably to previously published results. Temporary storage is not used because it tends 
to be expensive in physical implementations. For each gate library, best gate counts can be achieved by a 
single universal circuit. To compute gate parameters in universal circuits, we only use closed-form algebraic 
expressions, and in particular do not rely on matrix exponentials. Our algorithm has been coded in C++. 



PACS numbers: 03.67.Lx, 03.65.Fd 03.65.Ud 

I. INTRODUCTION 

Recent empirical work on quantum communication, cryp- 
tography and computation Ql resulted in a number of exper- 
imental systems that can implement two-qubit circuits. Thus, 
decomposing arbitrary two-qubit operators into fewer gates 
from a universal library may simplify such physical imple- 
mentations. While the universality of various gate libraries 
has been established in the past |3|, the minimization of 
gate counts has only been studied recently. Universal quan- 
tum circuits with six, four and three CNOT gates have been 
found that can simulate an arbitrary two-qubit operator up to 
phase It has also been shown that if the CNOT gate 

is the onlytwo-qubit gate available, then three CNOT gates are 
required laH^]. Many of these results rely on the Makhlin 
invariants |9 l| or the related magic basis and canonical decom- 
position llfl[llllT2l[l3ll. Similar invariants have been investi- 
gated previouslv Il4lll5ll and more recently in IT^ . 

Our work improves or broadens each of the above circuit 
constructions and lower bounds, as summarized in Table U 
We rely on the Makhlin invariants |9], and simplify them for 
mathematical and computational convenience — our version 
facilitates circuit synthesis algorithms. We have coded the 
computation of specific gate parameters in several hundred 
lines of C++, and note that it involves only closed-form al- 
gebraic expressions in the matrix elements of the original op- 
erator (no matrix logarithms or exponents) . We articulate the 
degrees of freedom in our algorithm, and our program pro- 
duces multiple circuits for the same operator This may be 
useful with particular implementation technologies where cer- 
tain gate sequences are more likely to experience errors. Ad- 
ditionally, this paper contributes a lower bound for the num- 
ber of CNOT gates required to simulate an arbitrary «-qubit 
operator, which is tighter than the generic bound for arbitrary 
two-qubit operators T^.TtIi. 

The two lines in Table U give gate counts for circuits con- 
sisting of elementaryand basic gates, respectively. Both types 
were introduced in PQ, but basic gates better reflect gate costs 
in some physical implementations where all one-qubit gates 
are equally accessible. Yet, when working with ion traps, 

gates are significantly easier to implement than and Ry 
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{CNOT, any 2 or 3 of {R^, Ry, R.}} 
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18 
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18 


{CNOT, arbitrary 1-qubit gates } 


3 


9 


3 


10 



TABLE I: Constructive upper bounds on gate counts for generic 
circuits using several gate libraries. Each bound given for controlled- 
not (CNOT) gates is compatible with the respective overall bound. 
These bounds are tighter than those from |^|^ in all relevant cases. 

gates ifl^ . Our work uncovers another asymmetry, which is 
of theoretical nature and does not depend on the implemen- 
tation technology — a subtle complication arises when only 
CNOT, Rx and R^ gates are available. 

Our work shows that basic-gate circuits can be simpli- 
fied by temporarily decomposing basic gates into elementary 
gates, so as to apply convenient circuit identities summarized 
in Table |lll Indeed, all lower bounds in Table HI and the n- 
qubit CNOT bound above rely on these circuit identities. Addi- 
tionally, temporary decompositions into elementary gates may 
help optimizing pulse sequences in physical implementations. 

The remainder of this paper is structured as follows. Sec- 
tion lUdiscusses gate libraries and circuit topologies. Section 
mil derives the lower bounds of Table|l] Section Hvl classifies 
two-qubit operators up to local unitaries. Section |V] devel- 
ops some technical lemmata, and Section lVTl constmcts small 
circuits that match upper bounds in Table U Subtle compli- 
cations caused by the lack of the Ry gate are discussed in the 
Appendix and Section fvTIl 



n. GATE LIBRARIES AND CIRCUIT TOPOLOGIES 

We recall that the Bloch sphere isomorphism (3| identifies 
a unit vector n = {nx^ny,n,) with 0„ = nxOx + ny(3y +n^(3^. 
Under this identification, rotation by the angle around the 
vector « corresponds to the special unitary operator /?„(9) = 
g-iOne/2 jj jg from this identification that the decomposition 
of an arbitrary one- qubit gate U = t'"^ R-{Q)Ry{iSf)R-{Mf) arises 
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Of course, the choice of y,z is arbitrary; one may take any 
pair of orthogonal vectors in place of y,z- 

Lemma II.l Let n,m G R-', « _L m, and U e SU (2). Then one 
can find Q,(^, and \\t such that U (9)/?,,, ((]))/?„ 

In the case of « _L m, we have o„/?„,(9)o„ = /?„,(— 0) and 
R„{n/2)R,n{^)Rni—n/2) = Rp{^) for p ~ mx n. For conve- 
nience, we set Sn = R„{k/2); then S- is the usual S gate, up to 
phase. In the sequel, we always take m,n out of x,y,z- 

We denote by Q' the controlled-not (CNOT) gate with con- 
trol on the fl-th qubit and target on the b-th. We recall that R- 
gates commute past CNOTs on the control line and R^ gates 
commute past CNOTs on the target. Finally, for mathematical 
convenience, we multiply the CNOT gate by a global phase t, 
such that = — 1; to represent it as an element of SU{4-). 

In this work we distinguish two types of gate libraries for 
quantum operators that are universal in the exact sense (com- 
pare to approximate synthesis and the Solovay-Kitaev theo- 
rem). The basic-gate library j3| contains the CNOT, and all 
one-qubit gates. Elementary- gate libraries also CNOT gate 
and one-qubit gates, but we additionally require that they con- 
tain only finitely many one-parameter subgroups of SU{2). 
We call these elementary- gate libraries, and Lemma llLTI indi- 
cates that if such a library includes two one-parameter sub- 
groups of SU{2) (rotations about around orthogonal axes) 
then the library is universal. In the literature, it is common to 
make assertions like: d\m[SU (2")] = 4" — 1. Thus if a given 
gate library contains only gates from one-parameter families 
and fully-specified gates such as CNOT, at least 4" — 1 one- 
parameter gates are necessary jsl, ITtI Theorem 3.4]. Such 
dimension-counting arguments lower-bound the number of 
Rx,Ry,Rz gates required in the worst case js]]. 

To formalize dimension-counting arguments, we introduce 
the concept of circuit topologies — underspecified circuits 
that may have placeholders instead of some gates, only with 
the gate type specified. Before studying a circuit topology, 
we must fix a gate library and thus restrict the types of fully- 
specified (constant) gates and placeholders. We say that a 
fully-specified circuit C conforms to a circuit topology T if C 
can be obtained from T by specifying values for the variable 
gates. All A:-qubit gates are to be in SU (2*), i.e., normalized. 
For an « -qubit circuit topology T , we define ) C SU (2") 
to be the set of all operators that can be simulated, up to global 
phase, by circuits conforming to T . We say that T is universal 
iff g(T ) = SU (2"). In this work, constant gates are CNOTs, 
and placeholders represent either all one-qubit gates or a given 
one-parameter subgroup of SU (2). We label one-qubit gate 
placeholders by a,b,c, . . ., and one-parameter placeholders by 
R^, with subscripts x, y or z- 

We also allow for exphcit relations between placehold- 
ers. For example, circuits conforming to the one-qubit circuit 
topology aba^ must contain three one-qubit gates and the first 
and last must be inverse to each other. 

Circuit identities such as R„{Q)R„{(^) = R„{Q + <^) can be 
performed at the level of circuit topologies. This identity in- 
dicates that two R„ gates may always be combined into one 
R„ gate, hence anywhere we find two consecutive R„ place- 
holders in a circuit topology T , we may replace them with a 



single one without shrinking 2(T ). Of course, Q{'T ) does not 
grow, either, since /?„(\|/) = R„{Q)R„{\\t). We may similarly 
conglomerate arbitrary one-qubit gate placeholders, pass R^ 
(Rx) placeholders through the control (target) of CNOT gates, 
decompose arbitrary one-qubit gate placeholders into R„RinR„ 
placeholders for « _L m, etc. 

We now formalize the intuition that the dimension of 
SU (2") should match the number of one parameter gates. 

Lemma II.2 Fix a gate library consisting of constant gates 
and finitely many one-parameter subgroups. Then almost all 
n-qubit operators cannot be simulated by a circuit with fewer 
than 4" — 1 gates from the one-parameter subgroups. 

Proof: Fix a circuit topology T with fewer than £ <4" — I 
one-parameter placeholders. Observe that matrix multipli- 
cation and tensor product are infinitely differentiable map- 
pings and let / : ^ SU (2") be the smooth function that 
evaluates the operator simulated by T for specific values of 
parameters in placeholders. Accounting for global phase, 
= U42«=i Image (^/). Sard's theorem |19, p.39] de- 
mands that Image(^/) be a measure-zero subset of SU{2") 
for dimension reasons, and a finite union of measure-zero sets 
is measure-zero. 

For a given library, there are only countably many circuit 
topologies. Each captures a measure-zero set of operators, 
and their union is also a measure-zero set. □ 



III. LOWER BOUNDS 

Lemma llL2l implies that for any given elementary gate li- 
brary, one can find n-qubit operators requiring at least 4" — 1 
one-qubit gates. We use this fact to obtain a lower bound for 
the number of CNOT gates required. 

Proposition III.l Fix any gate library containing only the 
CNOT and one-qubit gates. Then almost all n-qubit op- 
erators cannot be simulated by a circuit with fewer than 
ri(4"-3«- 1)1 CNOT^flfei. 

Proof: Enlarging the gate library cannot increase the mini- 
mum number of CNOTs in a universal circuit. Thus we may 
assume the library is the basic-gate library. We show that any 
n-qubit circuit topology T with k CNOT gates can always be 
replaced with an n-qubit circuit topology T' with gates from 
the {R-, Rx, CNOT} gate library such that Q{T ) = Q{t') and 
l' has k CNOTs and at most 3n +4^ one-parameter gates. The 
proposition follows from 3n + 4A: > 4" — 1 . 

We begin by conglomerating neighboring one-qubit gates; 
this leaves at most n + 2k one-qubit gates in the circuit. Now 
observe that the following three circuit topologies parametrise 
the same sets of operators: 

C\{a®b)=c\{RxR^Rx®R^RxR,) = {Rx®Rz)C\{R,R,®RxRz) 

We use this identity iteratively, starting at the left of the circuit 
topology. This ensures that each CNOT has exactly four one- 
parameter gates to its left. (Note that we apply gates in circuits 
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left to right, but read formulae for the same circuits right to 
left.) The n one-qubit gates at the far right of the circuit can 
be decomposed into three one-parameter gates apiece. □ 

Corollary III.2 Fix an elementary-gate library. Then almost 
all two-qubit operators cannot be simulated without at least 
three CNOT gates and fifteen one-qubit gates. 

For elementary-gate libraries containing two out of the 
three subgroups Rx,Ry,R,, we give explicit universal two- 
qubit circuit topologies matching this bound in Section lVll 

Proposition III.3 Using the basic-gate library, almost all 
two-qubit operators require at least three CNOT gates, and 
at least basic nine gates total. 



Proof: Proposition llll. ll implies that at least three CNOT gates 
are necessary in general; at least five one-qubit placehold- 
ers are required for dimension reasons. The resulting over- 
all lower bound of eight basic gates can be improved further 
by observing that given any placement of five one-qubit gates 
around three CNOTs, one can find two one-qubit gates on the 
same wire, separated only by a CNOT. Using the R^RxRz or 
RxRzRx decomposition as necessary, the 5 one-qubit gates can 
be replaced by fifteen one-parameter gates in such a way that 
the closest parameterized gates arising from the adjacent one- 
qubit gates can be combined. Thus, if five one-qubit place- 
holders and three CNOTs suffice, then so do fourteen one- 
parameter placeholders and three CNOTs, which contradicts 
dimension-based lower bounds. □ 



IV. INVARIANTS OF TWO-QUBIT OPERATORS 

To study two-qubit operators that differ only by pre- or 
post-composing with one-qubit operators, we use the termi- 
nology of cosets, common in abstract algebra I20I1 . Let G be 
the group of operators that can be simulated entirely by one- 
qubit operations. That is, G — SU (2)®" ~ {ai^a2<E) ■ ■ - ^a^ : 
aj £ SU{2)}. Then two operators u,v are said to be in the same 
left coset of SU{4) modulo G (written: uG = vG) iff u differs 
from V only by pre-composing with one-qubit operators; that 
is, if M = vg for some g € G. Similarly, we say that u and v 
are in the same right coset {Gu = Gv) if they differ only by 
post-composition (m = hv for some h E G), and we say that u 
and V are in the same double coset (u = GvG) if they differ 
by possibly both pre- and post-composition (u = hvg for some 
g,h £ G). In the literature, the double cosets are often referred 
to as local equivalence classes |!4|. 

Polynomial invariants classifying the double cosets have 
been proposed by Makhlin 0]. In what follows, we present 
equivalent invariants which generalize to «-qubits and are 
more straightforward to compute. Moreover, the proofs given 
here detail an explicit constructive procedure to find a,b,c,d 
such that (fl (g) b)u{c®d) ~ v, once it has been determined by 
computing invariants that m, v are in the same double coset. 
Definition IV.l We define y„ on 2" x 2" matrices by the 
formula u ^ uaf"u^af". When n is arbitrary or clear from 



Proposition rV.2 j has the following properties: 
1. 

2. y{ab) = aj{b)j{a'^ya-^ 

3. y(a®b) =y{a)(^y{b) 

4. gGMf^, =^ y{g)^det{g)-I 

5. y is constant on the left cosets u ■ SU{2)®" 

6. x[y] is constant on double cosets SU{2)®" ■ u ■ SU{2)^" 

Proof: (1), (2), and (3) are immediate from the definition. (4) 
can be checked explicitly for « = 1, and then the general case 
follows from (3). For (5), note first that g G 5f/(2)®" =^ 
^{g) ~ I hy (4). Then expressing "/{ag) and "/{a ■ I) using (1) 
and (2), we see they are equal. For (6), we use (2), (4), and 
(5) to see that g,h £ 5f/(2)®" =^ y{gah) = g-^y{ah)g = 
g^^y{a)g thus x[y{gah)] = j}C{{a)\. Incidentally, (6) is closely 
related to (H Thm 1.3]. □ 

While y is constant on left cosets and %[y] on double cosets, 
these invariants do not in general suffice to classify cosets. 
Roughly, a parameter space for double cosets would need 
dimension dim(5;7(2")) - 2dim(5f/(2)®") = 4" - 6« - 1, 
whereas the space of possible 5([y] has dimension 2" — 1 (be- 
cause the 2" roots of x(y) must all have unit length and have 
unit product). The first dimension is much larger except for 
n = 1,2. In the case n = 1, there is only one left coset (and 
only one double coset), so our invariants trivially suffice. For 
n ~ 2, these numbers come out exactly equal, and y and 5([y] 
serve to classify respectively the left cosets and double cosets. 

Proposition IV.3 For u,v e SU{4-), G = SU{2) ® SU{2): 



1. ueG j{u) = I 

2. uG = vG y{u) = y(v) 

3. GuG = GvG X[y(u)] = 



x[y(v)] 



Proof: Recall that E e f/(4) can be found such that 
E 5(9(4) E^ = G; such matrices are characterized by the prop- 



erty that EE 



-O,. (g) Oy. This and related issues have been 



exhaustively dealt with in several papers fl(i fill 113. fl3lfl6ll . 

where it is shown that E can be chosen as: 



1 







1 

V2 



\ 



1 





; -1 
\ 1 -I / 



Observe that the properties y^u) = I,j{u) = y(v),x[y(M)] = 
X[y(v)] are not changed by replacing y with ii^yE. Then using 
the fact —Oy (g) Oy = EE^ = {EE'^y compute: 



E'y{g)E 



- E'^gEE'^g'^E'^E^E 



iE^gE)iE'gEy 



context, we write y for y„. 



Therefore it suffices to prove the proposition after making 
the following substitutions: g ^ u = E' gE, G i~> SO{A), 
y{g) ^ uu^ . Now (1) is immediate and (2) follows from 
uu^ = vv^ ^=> v' M = {v'^u)''^ v^u e S0{4) 

To prove (3), note that for P symmetric unitary, P^^ = P, 
hence [P + P,P — T] = Q. It follows that the real and imag- 
inary parts of P share an orthonormal basis of eigenvectors. 
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As they are moreover real symmetric matrices, we know from 
the spectral theorem that their eigenvectors can be taken to be 
real. Thus one can find an a e 5(9(4) such that auu^a' is diag- 
onal. By re-ordering (and negating) the columns of a, we can 
re-order the diagonal elements of auu^ as desired. Thus if 
X[mm"^] = x[vv^], we can find a,/? € 50(4) such that auu^a^ = 
bvv^b^ by diagonalizing both; then {\''^b^au){\''^b^auY = I- 
Let c = Vb^au G 5(9(4). We have a^bvc = u, as desired. □ 

The proof above gives an algorithm for computing a, Z?, c, d for 
given two-qubit u and v so that {a®b)u{c®d) ~ v. Also, u 
may be chosen as a relative-phasing of Bell states. 

V. TECHNICAL LEMMATA 



C7.V 



FIG. 1: Circuit identities to move (3x, CTj past CNOT. The Ojj identity 
is standard in the theory of classical reversible circuits, where Ox is 
just the NOT gate, and amounts to the statement that (1 ffia) © (1 ffi 
b) = {a®b). The identity can be obtained from it by conjugating 
byH®H. 



We have shown x[y(v)] = %[C\ {Rz{?>) ® Ry{'(>))C\ (K 
Again, since x[B] ~ x[A^^BA], we conjugate by 7(5 
fixes the CNOT gate and replace Ry gates with R^: 



)Ry{a))]. 
•Sx- This 



We present two parameterizations of the space of double 
cosets described in Section lTvl These will be used in the con- 
structions of universal two-qubit circuit topologies to follow. 

We will use the following general technique to compute 
y(M). First, determine a circuit, C, simulating the operator 
u. Given C, it is straightforward to obtain a circuit simu- 
lating af^u^of^: reverse the order of gates in C, and re- 



low, if § is a one-qubit gate, then af'^g^a'. 



CNOT, we note that apCf&y 
and (yf^u^& 



place a given gate g by (J^ g Gy . As will be shown be- 

= g\ For the 
and similarly 

€2(0^ Ox). Now, combine the circuits for u 
2 to obtain a circuit simulating y{u). 

Proposition V.l For any u € SU (4), one can find a, P, 5 such 



that z[Y(«)] = x[y(C? (/ ® Ry (a) )C\{R, (5) ®Rymc\)]. 



Proof: Le.'iv^C\{I®Ry{a))C\{R-{h)®Ry{^))C\. As v is 
given explicitly by a circuit, we use the technique described 
above to determine the following circuit for y(v). 
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Here, R'y = Ry{a), Ry = Ry{^), and R, = R:{d). We now 
use the circuit identities in Figure □ and OiRj{Q) ^ Rj{-d)Oi 
to push all the o, gates to the left of the circuit, where they 
cancel up to an irrelevant global phase of — 1 . All gates in 
the wake of their passing become inverted, and we obtain the 
following circuit. 
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X[Y(v)] = x[cliRz{5)®Rzm)C\ (/®^z(a))] 

Finally, we ensure that the entries of the diagonal matrix 
C\{R,{5) R,{^))C^{I R^{a)) match the spectrum ofy{U) 



by specifying a = P = 3^, 
any three eigenvalues of Y(t^)- 



and 5 



for e'-\e'y,e" 



□ 



Proposition V.l For any u € 5J7(4), one can find 6, (|),\|/ such 
that x[tuC\{I®R,{Mf))C\)] = x[yiCiiRx{Q)®Rzmci)]. 

Proof: We set A = C2 (/ (8) ^.(\|/))C2 and compute tr[Y(MA)] . 
By Proposition II V. 21 this is tr[Y(M"^)"^Y(^)]- Explicit compu- 
tation as in the previous proposition gives y{A) = A^, and 
one obtains tr[Y(MA)] = (fi + t4)e''^ + {t2 + ti,)e'^, where 
fi,f2,f3,f4 are the diagonal entries of y{u^Y . We may ensure 

that this number is real by requiring tan(i|/) = ^||'^|^^^^^''*| . 

Now consider m G SU{N), x[m] = Lfl;^' = FK^ - n), 
where the r, form the spectrum of m. Since m £ SU (N), we 
must have Yl^i ^ ^ ~ IVn- Therefore, = xWll'^ = 
UinX - 1). Expanding the equahty U{X - n) = l\{nX - I) 
gives Oi = aN-i- In particular, for N = 4, a2 G M., and tr(m) = 
03 = flj. Since 04 = ao = x[m] has all real coefficients iff 
tr[m] E M. In this case, the roots of x[iii] must come in con- 
jugate pairs: xi'n) = ~ e'''){X - e-'^JiX - e")(X - e""). 
On the other hand, for w = (/?x ( 3^ ) «) ( ^ ) )C] , one can 
verify that xiYl*^)] takes this form. 

Taking m = y{UC2{I <E) Rz{^))C2), with \|/ as determined 



above, we obtain 9 



r— .y 
2 ■ 



□ 



VI. MINIMAL TWO-QUBIT CIRCUITS 



For invertible matrices, xi^B) ~ x(A"'(AB)A) = xi^A). 
In view of the fact that we are ultimately interested only in 
X[y(V)] we may move gates from the left of the circuit to the 
right. Thusly conglomerating R[. gates and canceling paired 
CNOT gates, we obtain: 













-< 


-4 




-4 













We now construct universal two-qubit circuit topologies 
that match the upper bounds of Table U We consider three 
different gate libraries: each contains the CNOT, and two out 
of the three one-parameter gates {Rx, Ry, Rz}- We will refer 
to these as the CXY, CYZ, and CXZ gate libraries. 

In view of Lemma llLTI one might think that there is no sig- 
nificant distinction between these cases. Indeed, conjugation 
by the Hadamard gate transforms will allow us to move eas- 
ily between the CXY and CYZ gate libraries. However, we 
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FIG. 2: A universal two-qubit circuit with three CNOT gates. It 
requires 10 basic gates 01 or 18 gates from {CNOT, Ry, R;}. 



will see that the CXZ gate library is fundamentally different 
from the other two. Roughly, the reason is that and can 
be respectively moved past the target and control of the CNOT 
gate, while no such identity holds for the Ry gate. While the 
CXY and CYZ libraries each only contain one of {R^, Rz}, the 
CXZ gate library contains both, and consequently has differ- 
ent characteristics. Nonetheless, gate counts will be the same 
in all cases. We begin with the CYZ case, which has been 
previously considered in |5]. 

Theorem VI.l Fifteen {Ry, R^} gates and three CNOT5 suf- 
fice to simulate an arbitrary two-qubit operator 

Proof: Choose a, p, 5 as in Proposition lV.il Then by Propo- 
sition lIV.31 one can find a,b,c,d E SU{2) such that 

U = ia(x)b)C^^{I(x)Ry{a))C^{R,{?>)®Ry{^))Cj{c®d) 

Thus, the circuit topology depicted in Figure|2|is universal. □ 

Theorem VI.2 Fifteen {R^, 7?,.} gates and three CNOTi suf- 
fice to simulate an arbitrary two-qubit operator 

Proof: Conjugation by H®" fixes SU{T) and Ry. It also flips 
CNOT gates {H^^CjH^^ = C\) and swaps R^ with R^, □ 
Unfortunately, no such trick transforms CYZ into CXZ. 
Any such transformation would yield a universal two-qubit 
circuit topology in the CXZ library in which only three one- 
parameter gates occur in the middle. We show in the Ap- 
pendix that no such circuit can be universal and articulate the 
implications of this distinction in Section IVTIl Nonetheless, 
we demonstrate here a universal two-qubit circuit topology 
with gates from the {Rx, Rz, CNOT} gate Hbrary that contains 
15 one-qubit gates and 3 CNOT gates. 

Theorem VI.3 Fifteen {R^, Rz} gates and three CNOTs suf- 
fice to simulate an arbitrary two-qubit operator 

Proof: Let U' be the desired operator; set U ~ U'C2 ■ Choose 
9,(|),\|/ for U' as in Proposition lV.2l By Proposition lIV.3l one 
can find a,b,c,d e SU (2) such that 

t/(/(g)/?-(\|/))C] = {a®b)C\iRziQ)(g)Rx{(^))C\ic®d) 



FIG. 3: Another universal two-qubit circuit with three CNOT gates. 
It requires 10 basic gates Q| or 18 gates from {CNOT, Rx, R;}. 



FIG. 4: The result of our algorithm applied to the two-qubit Quan- 
tum Fourier Transform. The circuit contains 3 one-qubit gates and 3 
CNOTs, but the one-qubit gates are broken up into elementary gates 
for specificity. Here, T- = R.{tz/4) is the T gate defined in jl|l up to 
a global phase. 



Solving for U gives the overall circuit topology in Figure|3l □ 

Unlike the circuit of IVI.ll the circuit in Figure |3l can be 
adapted to both other gate libraries. We can replace c by 
SziSlc) and a by {aSz)Sl, then use the Sz,Sl gates to change 
the Rx gate into an Rz- A similar trick using Rx can change the 
bottom Rz gates into Ry.; this yields a circuit in the CYZ gate 
library. As in Theorem lVI.2l conjugating hy H (E)H yields a 
circuit in the CXY gate library. 

Given an arbitrary two-qubit operator, individual gates in 
universal circuits can be computed by interpreting proofs of 
Propo sitions IV2l IVTI and lltGl Theorems IVLTI NUl and 
IVI.3l as algorithms. By re-ordering eigenvalues in the proof 
of Proposition lIV.3l one may typically produce several differ- 
ent circuits. Similar degrees of freedom are discussed in jsj]. 

To complete TableU count basic gates in Figure|2lor|3l 



VII. CONCLUSIONS 

Two-qubit circuit synthesis is relevant to on-going physics 
experiments and can be used in peephole optimization of 
larger circuits, where small sub-circuits are identified and sim- 
plified one at a time. This is particularly relevant to quantum 
communication, where protocols often transmit one qubit at a 
time and use encoding/decoding circuits on three qubits. 

We constructively synthesize small circuits for arbitrary 
two-qubit operators with respect to several gate libraries. 
Most of our lower and upper bounds on worst-case gate counts 
are tight, and rely on circuit identities summarized in Tablellll 
We also prove that «-qubit circuits require [^(4" — 3n — 1)] 
CNOT gates in the worst case. 

While our techniques do not guarantee optimal circuits for 
non-worst-case operators, they perform well in practice: one 
run of our algorithm produced the circuit shown in Figure |3 
for the two-qubit Quantum Fourier Transform. We show else- 
where that this circuit has minimal basic-gate count. 

A somewhat surprising result of our work is the apparent 
asymmetry between Rx, Ry and gates. While one would ex- 
pect any circuit topology for CNOT, and Ry to carry over to 
other elementary-gate libraries, we prove a negative result for 
the library CNOT, and Rx- Namely, using Ry gates appears 
essential for the minimal universal circuit topology shown in 
Figure|2j which exhibits the maximal possible number of one- 
qubit gates that are not between any two CNOT gates. 

The asymmetry between elementary one-qubit gates di- 
rectly impacts peephole optimization of 71-qubit circuits. 
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Circuit identities 


Descriptions 


C)C) = 1 

CO-' '*(»-''''' = 1 

C)ci = co-'-^C* 


CNOT-gate cancellation 
SWAP-gate cancellation 
CNOT-gate elimination 


clRi(e) = Ri{e)cl, cisi = sici 

ClR'liQ) = R'i{&)Cl, els'", = S'lcl 


moving Rx, Sx via CNOT target 
moving R,, via CNOT control 


C)al = o/o*C) 


moving Ox via CNOT control 
moving via CNOT target 


Cjco-'-'-' = co>^'-C/ 
V^'co'-*' = C0''*V* 


moving CNOT via SWAP 
moving a 1-qubit gate via SWAP 


/?„(e)i?„((t))=i?„(e+(j)) 


merging R„ gates, 
changing axis of rotation 



TABLE II: Circuit identities used in out work. Here represents 
an arbitrary one-qubit operator acting on wire /. 



where decompositions like that in Figure |2] are preferrable 
over that in Figure |3 For example, consider a three-qubit 
circuit consisting of two two-qubit blocks on lines (i) one and 
two, (ii) two and three. If both blocks are decomposed as in 
Figure |2] then the b gate from the first block and the c gate 
from the second block merge into one gate on line two. How- 
ever, no such reduction would happen if the decomposition 
from Figure|3]is used. 

Acknowledgments and disclaimers. This work is funded 
by the DARPA QuIST program and an NSF grant. The views 
and conclusions contained herein are those of the authors and 
should not be interpreted as necessarily representing official 
policies or endorsements of employers and funding agencies. 



sides. By Corollary IIII.2I there exist operators that cannot 
be simulated without 15 one-parameter gates; the remaining 
three must go in the middle of the circuit. □ 

We have seen that for the CYZ and the CXY gate libraries, 
this lower bound is tight. We will show that this is not the 
case for the CXZ gate library. Before beginning the proof, we 
make several observations about the CXZ gate library. 

Note that conjugating a circuit identity hy H®H exchanges 
Rx and gates, and flips CNOTs. Two other ways to produce 
new identities from old are: swapping wires, and inverting the 
circuit - reversing the order of gates & replacing each with its 
inverse. For example, one may obtain one of the commutativ- 
ity rules below from the other by conjugating hy H (E)H and 
then swapping wires. 



When one CNOT gate occurs immediately after another in 
a circuit, we say that they are adjacent. When such pairs of 
CNOTs share control lines, they cancel out, and otherwise may 
still lead to reductions as discussed below. We will be in- 
terested in circuits which do not allow such simplifications. 
To this end, recall that Rx gates commute past the target of a 
CNOT, and R, gates commute past the control. Moreover, we 

:\{Rx{a)®Rmci^ 



have the following circuit identity: Cl^iRxi^) (g) R,(^))C2 



Cj{R^{^) (g) Rx{a))Ci. We say that a given collection of one- 
qubit gates effectively separates a chain of CNOTs iff there is 
no way of applying the aforementioned transformation rules 
to force two CNOT gates to be adjacent. For example, there is 
no way to effectively separate two CNOTs of opposite orienta- 
tion by a single Rx or R- gate. This is illustrated below. 



Appendix 

We now illustrate the counterintuitive difference between 
(i) the CXZ library, and (ii) libraries CYZ and CXY. Namely, 
universal circuit topologies with certain properties exist only 
for the CYZ and CXY libra ries. 

The proof of Proposition lVI. ll contains a universal generic 
circuit with three CNOT gates and 15 Ry or R^ gates with the 
property that all but three of the one-qubit gates appear either 
before the first or after the last CNOT gate. This is minimal. 

Proposition VII.l Fix an elementary- gate library. There ex- 
ist unitary operators U € SU{4) that cannot be simulated by 
any two-qubit circuit in which all but two of the one-qubit 
gates appear either before the first or after the last CNOT gate. 

Proof: There are four places where the one-parameter gates 
can appear: at the left or right of the first or second line. If 
more than three gates appear in one such place, conglom- 
erate them into a single one-qubit gate, and decompose the 
result into three one-parameter gates via Lemma III. II By 
this method, any two-qubit circuit can be transformed into an 
equivalent circuit with at most 12 one -parameter gates on its 



R^^ 



^Rx- 



On the other hand, two CNOT gates of the same orienta- 
tion can be effectively separated by a single Rx or R^ gate, as 
shown below. Up to swapping wires, these are the only ways 
to effectively separate two CNOTs with a single Rx or R,. 



^ ^ 



Proposition VII.2 At least four gates from {Rx, Rz} are nec- 
essary to effectively separate four or more CNOT gates. 

Proof: Clearly it suffices to check this in the case of exactly 
four CNOTs. If three Rx, R- gates sufficed, then one would 
have to go between each pair of CNOT gates. Suppose all the 
CNOT gates have the same orientation, say with control on the 
bottom wire. Then the first pair must look like one of the pairs 
above. In either case, we may use the identity C^^iRxicC) (S) 
/?-(P))C] = Cj{R,{^)(g)Rx{a))Cj to flip these CNOT gates, 
thus ensuring that there is a consecutive pair of CNOT gates 
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with opposite orientations. As remarked above, there is no 
way to effectively separate these using the single one-qubit 
gate allotted them. □ 

Denote by co'^ the SWAP gate which exchanges the /- 
th and j-th qubits. It can be simulated using CNOTs as 
C^iC'jCj = CO'-' = C'jCjC'j. SWAP gates can be pushed through 
an elementary-gate circuit without introducing new gates. So, 
consider a two-qubit circuit in which adjacent CNOT gates ap- 
pear If they have the same orientation (eg. C^C^ or C2C2), 
then they cancel out and can be removed from the circuit. Oth- 
erwise, use the identity CjCj = Cjco'^ or CjCj = C^co'^ and 
push the SWAP to the end of the circuit. We apply this tech- 
nique at the level of circuit topologies and observe that since 
Q{'T(0^^) is measure-zero (or universal) iff Ql"!) is. By the 
above discussion, we can always reduce to an effectively sep- 
arated circuit before checking these properties. 

Proposition VII.3 Almost all unitary operators U G SU{4) 
cannot be simulated by any two-qubit circuit with CXZ gates 
in which all but three of the Rx,Rz gates appear either before 
the first or after the last CNOT. 

Proof: We show that any circuit topology of the form above 
can only simulate a measure-zero subset of SU (4); the result 
then follows from the fact that a countable union of measure- 
zero sets is measure-zero. 

The assumption amounts to the fact that only three gates 
are available to effectively separate the CNOT gates. By 
Proposition IVII.2I and the discussion immediately following 
it, we need only consider circuit topologies with no more than 
three CNOTs. On the other hand, we know from Proposi- 
tion lIII.3l that any two-qubit circuit topology with fewer than 
three CNOT gates can simulate only a measure-zero subset of 
5'L'(4). Thus it suffices to consider circuit topologies with ex- 
actly three CNOT gates. Moreover, we can require that they 
be effectively separated, since otherwise we could reduce to a 
two-CNOT circuit. 

Three CNOTs partition a minimal two-qubit circuit in four 
regions. We are particularly interested in the two regions lim- 
ited by CNOTs on both sides because single-qubit gates in 
those regions must effectively separate the CNOTs. To this 
end, we consider two pairs of CNOTs (the central CNOT is in 
both pairs), and distinguish these three cases: (1) both pairs 
of CNOTs consist of gates of the same orientation, (2) both 
consist of gates of opposite orientations, or (3) one pair has 
gates of the opposite orientations and the other pair has gates 
of the same orientation. In the second case, the CNOT gates 
cannot be effectively separated, since each pair of gates with 
opposite orientations requires two one-parameter gates to be 
effectively separated, and only three R^, R; gates are available. 
In the third case, two CNOTs with opposite orientations must 
be separated by two one-parameter gates, leaving only one R^ 
or R- to separate the pair with the same orientation. Thus, 
the pair with the same orientation may be flipped, reducing to 
Case 1, as shown below. 





Rx 


n 








r 




R- - 


-Rx 















e 



Rx 



Rz - Rx 



Finally, consider the case in which all three CNOT gates 
have the same orientation. Each pair of consecutive CNOTs 
must have at least one R^ or R- between them, to be effectively 
separated. Thus one of the pairs has a single 7?^ or R- between 
its members, and the other has two one-qubit gates. We refer 
to these as the 1-pair and the 2-pair, respectively. 

Suppose that the one-qubit gates separating the 2-pair of 
CNOTs occur on different lines. If either one-qubit can com- 
mute past the CNOTs of the 2-pair, then it can move to the 
edge of the circuit; in this case Proposition lVII. ll implies that 
the circuit topology we are looking at can only simulate a 
measure-zero subset of SU (4) (one can show that two Rx, Rz 
gates cannot effectively separate three CNOTs.) Otherwise, we 
use the identity {R_,{a)(SRzi^))Cl = Cf (/^.(P) (g)/?.v(a))C? 
to flip the 2-pair, and thus 1-pair now have opposite orienta- 
tions. As there is only one one-qubit gate between them, this 
pair is not effectively separated. For example: 



Rx 



Rx 



We are left with the possibility that all the CNOT gates have 
the same orientation and that the 2-pair's one-qubit gates ap- 
pear on the same line. Both R^, Rx must occur, or else we 
could combine them and apply Proposition lVII. 1 I to show that 
such a circuit topology can only simulate a measure-zero sub- 
set of SU (2"). Now, if R;,R- appears between two CNOT gates 
of the same orientation, then either the R^ or the can com- 
mute past one of them. If the outermost gate can commute, 
Proposition lVII. ll again implies that the circuit topology sim- 
ulates only a measure-zero subset of SU{2"). Thus the inner 
gate can commute with the 1-pair. We have now interchanged 
the roles of the 1-pair and the 2-pair, thus by the previous 
paragraph, the gate which originally separated the 1-pair must 
be on the same line as the commuting gate. It follows that 
all gates are on the same line. Up to conjugating hy H (E)H, 
swapping wires, and inverting the circuit, this leaves exactly 
one possibility. 



Rz - Rx 



Finally, we add the four one-qubit gates on the sides, de- 
compose each into R^RzRx via Lemma UlTI and observe that 
an Rx gate can commute across the top and be absorbed on the 
other side. This leaves 14 one-parameter gates, and by Lemma 
III. 21 such a circuit topology simulates only a measure-zero 
subset of (4). □ 
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